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FOCUSED LIBRARIES OF GENETIC PACKAGES 

This application claims the benefit under 35 USC 
§ 120 of United States provisional application 60/256,380, 
filed December 18, 2001. The provisional application and 
5 the Tables attached to it are specifically incorporated by 
reference herein. 

The present invention relates to focused 
libraries of genetic packages that each display, display 
and express, or comprise a member of a diverse family of 

10 peptides, polypeptides or proteins and collectively 
display, display and express, or comprise at least a 
portion of the focused diversity of the family. The 
focused diversity of the libraries of this invention 
comprises both sequence diversity and length diversity. In 

15 a preferred embodiment, the focused diversity of the 

libraries of this invention is biased toward the natural 
diversity of the selected family. In a more preferred 
embodiment, the libraries are biased toward the natural 
diversity of human antibodies and are characterized by 

20 variegation in their heavy chain and light chain 
complementarity determining regions ("CDRs") . 

The present invention further relates to vectors 
and genetic packages (e.g., cells, spores or viruses) for 
displaying, or displaying and expressing a focused diverse 

25 family of peptides, polypeptides or proteins. In a 
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preferred embodiment the genetic packages are filamentous 
phage or phagemids or yeast. Again, the focused diversity 
of the family comprises diversity in sequence and diversity 
in length. 

5 The present invention further relates to methods 

of screening the focused libraries of the invention and to 
the peptides, polypeptides and proteins identified by such 
screening. 

BACKGROUND OF THE INVENTION 

10 it is now common practice in the art to prepare 

libraries of genetic packages that individually display, 
display and express, or comprise a member of a diverse 
family of peptides, polypeptides or proteins and 
collectively display, display and express, or comprise at 

15 least a portion of the amino acid diversity of the family. 
In many common libraries, the peptides, polypeptides or 
proteins are related to antibodies (e.g., single chain Fv 
(scFv) , Fv, Fab, whole antibodies or minibodies (i.e., 
dimers that consist of V H linked to V L ) ) . Often, they 

20 comprise one or more of the CDRs and framework regions of 
the heavy and light chains of human antibodies. 

Peptide, polypeptide or protein libraries have 
been produced in several ways in the prior art. See e.g., 
Knappik etal., J. Mol. Biol., 296, pp. 57-86 (2000), which 

25 is incorporated herein by references. One method is to 
capture the diversity of native donors, either naive or 
immunized. Another way is to generate libraries having 
synthetic diversity. A third method is a combination of 
the first two. Typically, the diversity produced by these 

30 methods is limited to sequence diversity, i.e., each member 
of the library differs from the other members of the family 
by having different amino acids or variegation at a given 
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position in the peptide, polypeptide or protein chain- 
Naturally diverse peptides, polypeptides or proteins, 
however, are not limited to diversity only in their amino 
acid sequences. For example, human antibodies are not 

5 limited to sequence diversity in their amino acids, they 
are also diverse in the lengths of their amino acid chains. 

For antibodies, diversity in length occurs, for 
example, during variable region rearrangements. See e.g., 
Corbett et al., J. Mol. Biol., 270, pp. 587-97 (1997). The 

10 joining of V genes to J genes, for example, results in the 
inclusion of a recognizable D segment in CDR3 in about half 
of the heavy chain antibody sequences, thus creating 
regions encoding varying lengths of amino acids . The 
following also may occur during joining of antibody gene 

15 segments: (i) the end of the V gene may have zero to 

several bases deleted or changed; (ii) the end of the D 
segment may have zero to many bases removed or changed; 
(iii) a number of random bases may be inserted between V 
and D or between D and J; and (iv) the 5' end of J may be 

20 edited to remove or to change several bases. These 

rearrangements result in antibodies that are diverse both 
in amino acid sequence and in length. 

Libraries that contain only amino acid sequence 
diversity are, thus, disadvantaged in that they do not 

25 reflect the natural diversity of the peptide, polypeptide 
or protein that the library is intended to mimic. Further, 
diversity in length may be important to the ultimate 
functioning of the protein, peptide or polypeptide. For 
example, with regard to a library comprising antibody 

30 regions, many of the peptides, polypeptides, proteins 
displayed, displayed and expressed, or comprised by the 
genetic packages of the library may not fold properly or 
their binding to an antigen may be disadvantaged, if 
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diversity both in sequence and length are not represented 
in the library. 

An additional disadvantage of prior art libraries 
of genetic packages that display, display and express, or 
5 comprise peptides, polypeptides and proteins is that they 
are not focused on those members that are based on natural 
occurring diversity and thus on members that are most 
likely to be functional. Rather, the prior art libraries, 
typically, attempt to include as much diversity or 

10 variegation at every amino acid residue as possible. This 
makes library construction time-consuming and less 
efficient than possible. The large number of members that 
are produced by trying to capture complete diversity also 
makes screening more cumbersome than it needs to be. This 

15 is particularly true given that many members of the library 
will not be functional. 

SUMMARY OF THE INVENTION 

One objective of this invention is focused 
libraries of vectors or genetic packages that encode 

20 members of a diverse family of peptides, polypeptides or 
proteins wherein the libraries encode populations that are 
diverse in both length and sequence. The diverse length 
comprising components that contain motifs that are likely 
to fold and function in the context of the parental 

25 peptide, polypeptide or protein. 

Another object of this invention is focused 
libraries of genetic packages that display, display and 
express, or comprise a member of a diverse family of 
peptides, polypeptides and proteins and collectively 

30 display, display and express, or comprise at least a 
portion of the focused diversity of the family. These 
libraries are diverse not only in their amino acid 
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sequences , but also in their lengths. And, their diversity 
is focused so as to more closely mimic or take into account 
the naturally-occurring diversity of the specific family 
that the library represents. 
5 Another object of this invention is diverse, but 

focused, populations of DNA sequences encoding peptides, 
polypeptides or proteins suitable for display or display 
and expression using genetic packages {such as phage or 
phagemids) or other regimens that allow selection of 

10 specific binding components of a library. 

A further object of this invention is focused 
libraries comprising the CDRs of human antibodies that are 
diverse in both their amino acid sequence and in their 
length (examples of such libraries include libraries of 

15 single chain Fv (scFv) , Fv, Fab, whole antibodies or 

minibodies (i.e., dimers that consist of V H linked to V L ) ) . 
Such regions may be from the heavy or light chains or both 
and may include one or more of the CDRs of those chains. 
More preferably, the diversity or variegation occurs in all 

20 of the heavy chain and light chain CDRs. 

It is another object of this invention to provide 
methods of making and screening the above libraries and the 
peptides, polypeptides and proteins obtained in such 
screening. 

25 Among the preferred embodiments of this invention 

are the following: 

1. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 

30 peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
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variegated DNA sequences that encode a heavy chain CDR1 
selected from the group consisting of: 

(1) <1> 1 Y 2 <1> 3 M 4 <1> 5/ wherein <1> is an 
equimolar mixture of each of amino acid residues A, D, E, 

5 F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y; 

(2 ) (S/T) x (S/G/X) 2 (S/G/X) 3 Y 4 Y 5 W 6 (S/G/X) 7 . 
wherein (S/T) is a 1:1 mixture of S and T residues, (S/G/X) 
is a mixture of 0,2025 S, 0.2025 G and 0.035 of each of 
amino acid residues A, D, E, F, H, I, K, L, M, N, P, Q, R, 

10 T, V, W, and Y; 

(3) V 1 S 2 G 3 G 4 S 5 I 6 S 7 <l> 8 <l> 9 <l> 1 oY n Y 12 W 13 <l> 14 , 
wherein <1> is an equimolar mixture of each of amino acid 
residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, 
W, and Y; and 

15 (4) mixtures of vectors or genetic packages 

characterized by any of the above DNA sequences, preferably 
in the ratio: HC CDRls (1) : (2) : ( 3) : : 0 . 80 : 0 . 17 : 0 . 02 . 

2. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 

20 member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody facility, the 
vectors or genetic packages being characterized by 

25 variegated DNA sequences that encode a heavy chain CDR2 
selected from the group consisting of: 

(1) <2>K2X3>SGG<1>T<1>YADSVKG, wherein 
<1> is an equimolar mixture of each of amino acid residues 
A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and 

30 Y; <2> is an equimolar mixture of each of amino acid 
residues Y, R, W, V, G, and S; and <3> is an equimolar 
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mixture of each of amino acid residues P, S, and G or an 
equimolar mixture of P and S; 

(2) <l>I<4><lXlXGX5XlXlXl>YADSVKG, 
wherein <1> is an equimolar mixture of each of amino acid 

5 residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, 
W, and Y; <4> is an equimolar mixture of residues D, I, N, 
S, W, Y; and <5> is an equimolar mixture of residues S, G, 
D and N; 

(3) <l>I<4XlXl>G<5XlXl>YNPSLKG, wherein 
10 <1> is an equimolar mixture of each of amino acid residues 

A, D, E, F, G f H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 
and <4> and <5> are as defined above; 

(4) <1>K8>S<1X1X1>GGYY<1>YAASVKG / 
wherein <1> is an equimolar mixture of each amino acid 

15 residues A, D, E, F, G, H, I, K, L, M r N, P, Q, R, S, T, V, 
W and Y; <8> is 0.27 R and 0,027 of each of 
ADEFGHIKLMNPQSTVWY; and 

(5) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences , preferably 

20 in the ratio: HC CDR2s: (l)/(2) (equimolar): 
(3) : (4) : : 0 . 54 : 0 . 43 : 0 . 03 . 

3. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 

25 peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a heavy chain CDR3 

30 selected from the group consisting of: 

(1) YYCA2 1 1 1 1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 
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G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(2) YYCA2 11111 1 YFD YWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

5 G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(3) YYCA2 11111 111YFDYWG, wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
10 equimolar mixture of K and R; 

(4) YYCAR111S2S3111YFDYWG, wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of S and G; and 3 is an equimolar mixture 
15 of Y and W; 

(5) YYCA2 1 1 1CSG1 1CY1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 
20 (6) YYCA211S1TIFG11111YFDYWG, wherein 1 is 

an equimolar mixture of each amino acid residues A, D, E, 
F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is 
an equimolar mixture of K and R; 

(7) YYCAR1 1 1YY2 S 3 3 4 4 1 1 1 YFD YWG , wherein 1 is 
25 an equimolar mixture of each amino acid residues A, D, E, 

F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 2 is an 
equimolar mixture of D and G; and 3 is an equimolar mixture 
of S and G; 

(8) YYCAR1 1 1 1 YC2 2 3 1CY1 1 1 YFDYWG , wherein 1 
3 0 is an equimolar mixture of each amino acid residues A, D, 

E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 2 is 
an equimolar mixture of S and G; and 3 is an equimolar 
mixture of T, D and G; and 

RECTIFIED SHEET (RULE 91) 
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(9) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences, preferably 
the HC CDR3s (1) through (8) are in the following 
proportions in the mixture: 
5 (1) 0,10 

(2) 0.14 

(3) 0.25 

(4) 0.13 

(5) 0.13 

10 (6) 0.11 

(7) 0.04 and 

(8) 0.10; and more preferably the HC CDR3s 
(1) through (8) are in the following proportions in the 
mixture: 

15 (1) 0.02 

(2) 0.14 

(3) 0.25 

(4) 0.14 

(5) 0.14 

20 (6) 0.12 

(7) 0.08 and 

(8) 0.11. 

Preferably, 1 in one or all of HC CDR3s (1) 
through (8) is 0.095 of each of G and Y and 0.048 of each 
25 of A, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, and W. 

4. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
30 display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
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variegated DNA sequences that encodes a kappa light chain 
CDR1 selected from the group consisting of: 

( 1 ) RASQ<1>V<2X2X3>LA 

( 2 ) RASQ< 1 >V<2 >< 2 ><2 X 3 >LA ; 

5 wherein <1> is an equimolar mixture of amino acid residues 
ADE FGH I KLMN PQRST VWY ; <2> is 0,2 S and 0.044 of each of 
ADEFGHIKLMNPQRTVWY; and <3> is 0.2Y and 0.044 each of 
ADE FGH I KLMN PQRT VW and Y; and 

(3) mixtures of vectors or genetic packages 
10 characterized by any of the above DNA sequences,, preferably 

in the ratio CDRls (1) : (2) : : 0 . 68 : 0 . 32 . 

5. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a kappa light chain 
CDR2 having the sequence: 

<1>AS<2>R<4><1>, 
wherein <1> is an equimolar mixture of amino acid residues 
ADE FGH I KLMN PQRST VWY; <2> is 0.2 S and 0.04 4 of each of 
ADEFGHIKLMNPQRTVWY; and <4> is 0.2 A and ) 0.044 each of 
DE FGH IKLMN PQRST VWY . 

6. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 

30 display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
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vectors or genetic packages being characterized by 
variegated DNA sequences that encode a kappa light chain 
CDR3 selected from the groups consisting of: 

(1) QQ<3XlXlXl>P<l>T f 

5 wherein <1> is an equimolar mixture of amino acid residues 
ADE FGH I KLMN PQRS T VWY ; <3> is 0.2 Y and 0.044 each of 
ADE FGHIKLMN PQRTVW ; 

(2) QQ33111P, wherein 1 and 3 are as 
defined in (1) above; 

10 (3) QQ3211PPlT f wherein 1 and 3 are as 

defined in (1) above and 2 is 0.2 S and 0.044 each of 
ADE FGH I KLMN PQRT VW Y ; and 

(4) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences, preferably 

15 in the ratio CDR3s (1) : (2) : (3) : : 0 . 65 : 0 . 1 : 0 . 25. 

7. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 

20 display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a lambda light chain 
CDR1 selected from the group consisting of: 

25 (1) TG<1>SS<2>VG<1><3X2><3>VS, 

wherein <1> is 0.27 T, 0.27 G and 0.027 each of 
ADEFHIKLMNPQRSVWY, <2> is 0.27 D, 0.27 N and 0.027 each of 
AEFGHIKLMPQRSTVWY, and <3> is 0.36 Y and 0.036 each of 
ADEFGHIKLMNPQRSTVW; 

30 (2) G<2X4>L<4><4><4><3><4><4>, 
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wherein <2> is as defined in (1) above and <4> is an 
equimolar mixture of amino acid residues 
ADE FGH I KLMN PQRST VWY ; and 

(3) mixtures of vectors or genetic packages 
5 characterized by any of the above DNA sequences, preferably 
in the ratio CDRls (1) : (2) : :0. 67:0.33. 

8. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 

10 peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a lambda light chain 

15 CDR2 has the sequence: 

<4><4><4><2>RPS, 
wherein <2> is 0.27 D, 0.27 N, and 0.027 each of 
AEFGHIKLMPQRSTVWY and <4> is an equimolar mixture of amino 
acid residues ADEFGHIKLMNPQRSTVW. 

20 9. A focused library of vectors or genetic 

packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 

25 portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a lambda light chain 
CDR3 selected from the group consisting of: 

(1) <4><5><4><2><4>S<4><4><4X4>V, 

30 wherein <2> is 0.27 D, 0.27 N, and 0.027 each of 

AEFGHIKLMPQRSTVWY; <4> is an equimolar mixture of amino 
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acid residues ADE FGH I KLMN PQRST VW ; and <5> is 0.36 S and 
0.0355 each of ADE FGH I KLMN PQRT VWY ; 

. (2) <5>SY<1X5>S<5X1X4>V, wherein <1> is 
an equimolar mixture of ADE FGH I KLMN PQR S T VW Y ; and <4> and 
5 <5> are as defined in (1) above; and 

(3) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences, preferably 
in the ratio CDR3s (1) : (2) : : 1:1. 

10. A focused library comprising variegated DNA 
10 sequences that encode a heavy chain CDR selected from the 
group consisting of: 

(1) one or more of the heavy chain CDRls of 
paragraph 1 above; 

(2) one or more of the heavy chain CDR2s of 
15 paragraph 2 above; 

(3) one or more of the heavy chain CDR3s of 
paragraph 3 above; and 

(4) mixtures of vectors or genetic packages 
characterized by (1), (2) and (3). 

20 11. The focused library comprising one or more 

of the variegated DNA sequences that encodes a heavy chain 
CDR of paragraphs 1, 2 and 3 and further comprising 
variegated DNA sequences that encodes a light chain CDR 
selected from the group consisting of 

25 (1) one or more the kappa light chain CDRls 

of paragraph 4; 

(2) the kappa light chain CDR2 of 

paragraph 5; 

(3) one or more of the kappa light chain 
30 CDR3s of paragraph 6; 



WO 02/061071 



PCT/US01/50297 



- 14 - 

(4) one or more of the kappa light chain 
CDRls of paragraph 7; 

(5) the lambda light chain CDR2 of 

paragraph 8; 

5 (6) one or more of the lambda light chain 

CDR3s of paragraph 9; and 

(7) mixtures of vectors and genetic 
packages characterized by one or more of (1) through (6) . 

12 . A population of variegated DNA sequences as 
10 described in paragraphs 1-11 above, 

13, A population of vectors comprising the 
variegated DNA sequences as described in paragraphs 1-11 
above . 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Antibodies ("Ab") concentrate their diversity 
into those regions that are involved, in determining 
affinity and specificity of the Ab for particular targets. 
These regions may be diverse in sequence or in length. 
Generally, they are diverse in both ways. However, within 
families of human antibodies the diversities, both in 
sequence and in length, are not truly random. Rather, some 
amino acid residues are preferred at certain positions of 
the CDRs and some CDR lengths are preferred. These 
preferred diversities account for the natural diversity of 
the antibody family. 

According to this invention, and as more fully 
described below, libraries of vectors and genetic packages 
that more closely mirror the natural diversity, both in 



15 



20 



25 



WO 02/061071 



PCT/USO 1/50297 



- 15 - 

sequence and in length, of antibody families, or portions 
thereof are prepared and used. 

Human Antibody Heavy Chain Sequence and Length Diversity 
( a ) Framework 

5 The heavy chain ("HC") Germ-Line Gene (GLG) 3-23 

(also known as .VP-47) accounts for about 12% of all human 
Abs and is preferred as the framework in the preferred 
embodiment of the invention. It .should, however, be 
understood that other well-known frameworks, such as 4-34, 
10 3-30, 3-30.3 and 4-30.1, may also be used without departing 
from the principles of the focused diversities of this 
invention. 

In addition, JH4 (YFDYWGQGTLVTUSS) occurs more 
often than JH3 in native antibodies. Hence, it is 
15 preferred for the focused libraries of this invention. 
However, JH3 (AFDIWGQGTMVTVSS) could as well be used. 

(b) Focused Length Diversity: CDR1, 2 and 3 
(i) CDR1 

For CDR1, GLGs provide CDRls only of the lengths 
20 5, 6, and 7. Mutations during the maturation of the V- 
domain gene, however, can lead to CDRls having lengths as 
short as 2 and as long as 16. Nevertheless, length 5 
predominates. Accordingly, in the preferred embodiment of 
this invention, the preferred HC CDRl is 5 amino acids, 
25 with less preferred CDRls having lengths of 7 and 14. In 
the most preferred libraries of this invention, all three 
lengths are used in proportions similar to those found in 
natural antibodies . 
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(ii) CDR2 

GLGs provide CDR2s only of the lengths 15-19, but 
mutations during maturation may result in CDR2s of lengths 
from 16 to 28 amino acids. The lengths 16 and 17 
5 predominate in mature Ab genes. Accordingly, length 17 is 
the preferred length for HC CDR2 of the present invention. 
Less preferred HC CDR2s of this invention have lengths 16 
and 19. In the most preferred focused libraries of this 
invention, all three lengths are included in proportions 
10 similar to those found in natural antibody families. 

(iii) CDR3 

HC CDR3s vary in length. About half of human HCs 
consist of the components: V: : nz : : D: : ny : : JHn where V is a V 
gene, nz is a series of bases (mean 12) that are 

15 essentially random, D is a D segment, often with heavy 

editing at both ends, ny is a series of bases (mean 6) that 
are essentially random, and JH is one of the six JH 
segments, often with heavy editing at the 5' end. The D 
segments appear to provide spacer segments that allow 

20 folding of the IgG. The greatest diversity is at the 
junctions of V with D and of D with JH. 

In the preferred libraries of this invention both 
types of HC CDR3s are used. In HC CDR3s that have no 
identifiable D segment, the structure is V::nz::JHn where 

25 JH is usually edited at the 5 1 end. In HC CDR3s that have 
an identifiable D segment, the structure is 
V: :nz: :D: :ny: : JHn. 
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(c) Focused Sequence Diversity: CDR1, 2 and 3 
(i) CDR1 

In 5 amino acid length CDR1, examination of a 3D 
model of a humanized Ab showed that the side groups of 
5 residues 1, 3, and 5 were directed toward the combining 
pocket. Consequently , in the focused libraries of this 
invention, each of these positions may be selected from any 
of the native amino acid residues, except cysteine ("C") . 
Cysteine can form disulfide bonds, which are an important 

10 component of the canonical Ig fold. Having free thiol 

groups could, thus, interfere with proper folding of the HC 
and could lead to problems in production or manipulation of 
selected Abs. Thus, in the focused libraries of this 
invention cysteine is excluded from positions 1, 3 and 5 of 

15 the preferred 5 amino acid CDRls. The other 19 natural 
amino acids residues may be used at positions 1, 3 and 5. 
Preferably, each is present in equimolar ratios in the 
variegated libraries of this invention, 

3D modeling also suggests that the side groups of 

20 residue 2 in a 5 amino acid CDR1 are directed away from the 
combining pocket. Although this position shows substantial 
diversity, both in GLG and mature genes, in the focused 
libraries of this invention this residue is preferably Tyr 
(Y) because it occurs in 681/820 mature antibody genes. 

25 However, any of the other native amino acid residues, 
except Cys (C) , could also be used at this position. 

For position 4, there is also some diversity in 
GLG and mature antibody genes. However, almost all mature 
genes have uncharged hydrophobic amino acid residues: A, G, 

30 L, P, F, M, W, I, V, at this position. Inspection of a 3D 
model also shows that the side group of residue 4 is packed 
into the innards of the HC. Thus, in the preferred 
embodiment of this invention which uses framework 3-23, 
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residue 4 is preferably Met because it is likely to fit 
very well into the framework of 3-23. With other 
frameworks, a similar fit consideration is used to assign 
residue 4. 

5 Thus, the most preferred HC CDRl of this 

invention consists of the amino acid sequence <1>Y<1>M<1> 
where <1> can be any one of amino acid residues: A, D, E, 

F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y (not C) , 
preferably present at each position in an equimolar amount. 

10 This diversity is shown in the context of a framework 3- 
23-.JH4 in Table 1. It has a diversity of 6859-fold. 

The two less preferred HC CDRls of this invention 
have length 7 and length 14. For length 7, a preferred 
variegation is (S/T) x (S/G/<1» 2 (S/G/<1» 3 Y 4 Y 5 W 6 (S/G/<1» 

15 where (S/T) indicates an equimolar mixture of Ser and Thr 
codons; (S/G/<1>) indicates a mixture of 0.2025 S, 0.2025 

G, and 0.035 for each of A, D, E, F, H, I, K, L, M, N, P, 
Q, R, T, V, W, Y. This design gives a predominance of Ser 
and Gly at positions 2, 3, and 7, as occurs in mature HC 

20 genes. For length 14, a preferred variegation is 

VSGGSIS<1X1><1>YYW<1>, where <1> is an equimolar mixture 
of the 19 native amino acid residues, except Cys (C) . 

The DNA that encodes these preferred HC CDRls is 
preferably synthesized using trinucleotide building blocks 

25 so that each amino acid residue is present in essentially 
equimolar or other described amounts. The preferred codons 
for the <1> amino acid residues are get, gat, gag, ttt, 
ggt, cat, att, aag, ctt, atg, aat, cct, cag, cgt, tct, act, 
gtt, tgg, and tat. Of course, other codons for the chosen 

30 amino acid residue could also be used. 

The diversity oligonucleotide (ON) is preferably 
synthesized from BspEI to BstXI (as shown in Table 1) and 
can, therefore, be incorporated either by PCR synthesis 
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using overlapping ONs or introduced by ligation of 
BspEI/BstXI-cut fragments. Table 2 shows the 
oligonucleotides that embody the specified variegations of 
the preferred length 5 HC CDRls of this invention. PCR 
5 using ON-RlVlvg, ON-Rltop, and ON-Rlbot gives a dsDNA 
product of 73 base pairs, cleavage with BspEI and SstXI 
trims 11 and 13 bases from the ends and provides cohesive 
ends that can be ligated to similarly cut vector having the 
3-23 domain shown in Table 1. Replacement of ON-RlVlvg 

10 with either ONRlV2vg or ONRlV3vg (see Table 2) allows 

synthesis of the two alternative diversity patterns — the 
7 residue length and the 14 residue length HC CDR1. 

The more preferred libraries of this invention 
comprise the 3 preferred HC CDR1 length diversities. Most 

15 preferably, the 3 lengths should be incorporated in 

approximately the ratios in which they are observed in 
antibodies selected without reference to the length of the 
CDRs. For example, one sample of 1095 HC genes have the 
three lengths present in the ratio: 

20 L=5:L=7:L=14: :820:175:23: :0. 80:0.17:0.02. This is the 
preferred ratio in accordance with this invention. 

(ii) CDR2 

Diversity in HC CDR2 was designed with the same 
considerations as for HC CDR1 : GLG sequences, mature 

25 sequences and 3D structure, A preferred length for CDR2 is 
17, as shown in Table 1. For this preferred 17 length 
CDR2, the preferred variegation in accordance with the 
invention is: <2>I<2><3>SGG<1>T<1>YADSVKG, where <2> 
indicates any amino acid residue selected from the group of 

30 Y r R, W, V, G and S (equimolar mixture) , <3> is P, S and G 
or P and S only (equimolar mixture) r and <1> is any native 
amino acid residue except C (equimolar mixture) . 
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ON-R2Vlvg shown in Table 3 embodies this 
diversity pattern. It is preferably synthesized so that 
fragments of dsDNA containing the BstXI and Xbal site can 
be generated by PCR. PCR with ON-R2Vlvg, ON-R2top,and 
5 ONR2bot gives a dsDNA product of 122 base pairs. Cleavage 
with BstXI and Xbal removes about 10 bases from each end 
and produces cohesive ends that can be ligated to similarly 
cut vector that contains the 3-23 gene shown in Table 1. 

In an alternative embodiment for a 17 length HC 

10 CDR2, the following variegation may be used: 

<1>I<4X1X1>G<5X1X1X1>YADSVKG, where <1> is as 
described above for the more preferred alternative of HC 
CDR2; <4> indicates an equimolar mixture of DINSWY, and <5> 
indicates an equimolar mixture of SGDN. This diversity 

15 pattern is embodied in ON-R2V2vg shown in Table 3. 

Preferably, the two embodiments are used in equimolar 
mixtures in the libraries of this invention. 

Other preferred HC CDR2s have lengths 16 and 19. 
Length 16: <1>K4X1><1>G<5<1><1>YNPSLKG; Length 19: 

20 <1>K8>S<1X1X1>GGYY<1>YAASVKG, wherein <1> is an 

equimolar mixture of all native amino acid residues except 
C; <4> is a equimolar mixture of DINSWY; <5> is an 
equimolar mixture of SGDN; and <8> is 0.27 R and 0.027 of 
each of residues ADEFGHIKLMNPQSTVWY. Table 3 shows ON- 

25 R2V3vg which embodies a preferred CDR2 variegation of 
length 16 and ON-R2V4vg which embodies a preferred CDR2 
variegation of length 19. To prepare these variegations 
ON-R2V3vg may be PCR amplified with ON-R2top and ON-R2bo3 
and ON-R2V4vg may be PCR amplified with ON-R2top and ON-R2- 

30 bo4. See Table 3. In the most preferred embodiment of 
this invention, all three HC CDR2 lengths are used. 
Preferably, they are present in a ratio 
17:16:19: : 57 9: 464: 31: : 0.54: 0.43: 0.03. 
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(iii) CDR3 

The preferred libraries of this invention 
comprise several HC CDR3 components. Some of these will 
have only sequence diversity. Others will have sequence 
5 diversity with embedded D segments to extend the length, 
while also incorporating sequences known to allow Igs to 
fold. The HC CDR3 components of the preferred libraries of 
this invention and their diversities are depicted in 
Table 4: Components 1-8. 

10 This set of components was chosen after studying 

the sequences of 1383 human HC sequences. The proposed 
components are meant to fulfill the following goals: 

1) approximately the same distribution of lengths 
as seen in native Ab genes; 

15 2) high level of sequence diversity at places 

having high diversity in native Ab genes; and 

3) incorporation of constant sequences often seen 
in native Ab genes. 

Component 1 represents all the genes having 

20 lengths 0 to 8 (counting from the YYCAR motif at the end of 
FR3 to the WG dipeptide motif near the start of the J 
region, i.e., FR4). Component 2 corresponds the all the 
genes having lengths 9 or 10. Component 3 corresponds to 
the genes having lengths 11 or 12 plus half the genes 

25 having length 13. Component 4 corresponds to those having 
length 14 plus half those having length 13. Component 5 
corresponds to the genes having length 15 and half of those 
having length 16. Component 6 corresponds to genes of 
length 17 plus half of those with length 16. Component 7 

30 corresponds to those with length 18. Component 8 

corresponds to those having length 19 and greater. See 
Table 4. 
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For each HC CDR3 residue having the diversity 
<1>, equimolar ratios are preferably not used. Rather, the 
following ratios are used 0.095 [G and Y] and 0.048 [A, D, 
E, F, H, I, K, L, M, N, P, Q, R, S, T, V, and W] . Thus, 
5 there is a double dose of G and Y with the other residues 
being in equimolar ratios. For the other diversities, 
e.g., KR or SG, the residues are present in equimolar 
mixtures. 

In the preferred libraries of this invention the 

10 eight components are present in the following fractions: 1 
(0.10), 2 (0.14), 3 (0.25), 4 (0.13), 5 (0.13), 6 (0.11), 7 
(0.04) and 8 (0.10). See Table 4. 

In the more preferred embodiment of this 
invention, the amounts of the eight components is adjusted 

15 because the first component is not complex enough to 

justify including it as 10% of the library. For example, 
if the final library were to have 1 x 10 9 members, then 
1 x 10 8 sequences would come from component 1, but it has 
only 2.6 x 10 5 CDR3 sequences so that each one would occur 

20 in -385 CDR1/2 contexts. Therefore, the more preferred 
amounts of the eight components are 1(0.02), 2(0.14), 
3(0.25), 4(0.14), 5(0.14), 6(0.12), 7(0.08), 8(0.11). In 
accordance with the more preferred embodiment component 1 
occurs in -77 CDR1/2 contexts and the other, longer CDR3s 

25 occur more often. 

Table 5 shows vgDNA that embodies each of the 
eight HC CDR3 components shown in Table 4. In Table 5, the 
oligonucleotides (ON) Ctop25, CtprmA, CBprmB, and CBot25 
allow PCR amplification of each of the variegated ONs 

30 (vgDNA): Clt08, C2tl0, C3tl2, C4tl4, C5tl5, C6tl7, C7tl8, 
and C8tl9. After amplification, the dsDNA can be cleaved 
with Aflll and BstEII (or Kpnl) and ligated to similarly 
cleaved vector that contains the remainder of the 3-23 
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domain. Preferably, this vector already contains diversity 
in one, or both, of CDR1 and CDR2 as disclosed herein. 
Most preferably, it contains diversity in both the CDR1 and 
CDR2 regions. It is, of course, to be understood that the 
5 various diversities can be incorporated into the vector in 
any order. 

Preferably, the recipient vector originally 
contains a stuff er in place of CDR1, CDR2 and CDR3 so that 
there will be no parental sequence that would then occur in 

10 the resulting library. Table 6 shows a version of the V3- 
23 gene segment with each CDR replaced by a short segment 
that contains both stop codons and restriction sites that 
will allow specific cleavage of any vector that does not 
have the stuff er removed. The stuff er can either be short 

15 and contain a restriction enzyme site that will not occur 
in the finished library, allowing removal of vectors that 
are not cleaved by both A fill and BstEII (or Kpnl) and 
religated. Alternatively, the stuffer could be 200-400 
bases long so that uncleaved or once cleaved vector can be 

20 readily separated from doubly cleaved vector. 

Human Antibody Light Chain: Sequence and Length Diversity 
(i) Kappa Chain 
(a) Framework 

In the preferred embodiment of this invention, 
25 the kappa light chain is built in an A27 framework with a 
JK1 region. These are the most common V and J regions in 
the native genes. Other frameworks, such as 012, L2, and 
All, and other J regions, such as JK4, however, may be used 
without departing from the scope of this invention. 
30 (b) CDR1 

In native human kappa chains, CDRls with lengths 
of 11, 12, 13, 16, and 17 were observed with length 11 
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being predominant and length 12 being well represented. 
Thus, in the preferred embodiments of this invention LC 
CDRls of length 11 and 12 are used in an and mixture 
similar to that observed in native antibodies) , length 11 
5 being most preferred. Length 11 has the following sequence: 
RASQ<1>V<2><2><3>LA and Length 12 has the following 
sequence: RASQ<1>V<2><2><2><3>LA, wherein <1> is an 
equimolar mixture of all of the native amino acid residues, 
except C, <2> is 0.2 S and 0.044 of each of 

10 ADE FGH I KLMN PQRT VWY , and <3> is 0.2 Y and 0.044 each of A, 
D, E, F, G, H, I, K, L, M, N, P, Q, R, T, V, W and Y. In 
the most preferred embodiment of this invention, both CDR1 
lengths are used. Preferably , they are present in a ratio 
of 11:12: :154:73: :0. 68:0. 32. 

15 (c) CDR2 

In native kappa, CDR2 exhibits only length 7. 
This length is used in the preferred embodiments of this 
invention. It has the sequence <1>AS<2>R<4><1>, wherein <1> 
is an equimolar mixture of amino acid residues 

20 ADE FGH I KLMN PQRST VW Y ; <2> is 0 . 2 S and 0.004 of each of 

ADE FGH I KLMN PQ RT VW Y ; and <4> is 0.2 A and 0.044 of each of 
DEFGHIKLMNPQRSTCJWY. 

(d) CDR3 

In native kappa, CDR3 exhibits lengths of 1, 4, 
25 6, 7, 8, 9, "10, 11, 12, 13, and 19. While any of these 
lengths and mixtures of them can be employed in this 
invention, we prefer lengths 8, 9 and 10, length 9 being 
more preferred. For the preferred Length 9, the sequence is 
QQ<3><l><lxl>P<l>T, wherein <1> is an equimolar mixture of 
30 amino acid residues ADE FGH I KLMN PQRS T VWY and <3> is 0.2 Y 
and 0.044 each of ADE FGH I KLMN PQRS VW. Length 8 is 
preferably QQ33111P and Length 10 is preferably QQ3211PP1T, 
wherein 1 and 3 are as defined for Length 9 and 2 is S 
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(0.2) and 0.044 each of ADEFGHIKLMNPQRTVWY. A mixture of 
all 3 lengths being most preferred (ratios as in native 
antibodies), i.e., 8 : 9 : 10 : : 28 : 166: 63 : : 0 . 1 : 0 . 65 : 0 . 25 . 

Table 7 shows a kappa chain gene of this 
invention, including a PlacZ promoter, a ribosome-binding 
site, and signal seguence (M13 III signal) . The DNA 
sequence encodes the GLG amino acid sequence, but does not 
comprise the GLG DNA sequence. Restriction sites are 
designed to fall within each framework region so that 
diversity can be cloned into the CDRs. Xmal and Espl are 
in FR1, SexAI is in FR2, RsrII is in FR3, and Kpnl (or 
Acc65I) are in FR4 . Additional sites are provided in the 
constant kappa chain to facilitate construction of the 
gene. 

15 Table 7 also shows a suitable scheme of 

variegation for kappa. In CDR1, the most preferred length 
11 is depicted. However, most preferably both lengths 11 
and 12 are used. Length 12 in CDR1 can be construed by 
introducing codon 51 as <2> (i.e. a Ser-biased mixture). 
CDR2 of kappa is always 7 codons . Table 7 shows a 
preferred variegation scheme for CDR2. Table 7 shows a 
variegation scheme for the most preferred CDR3 (length 9) . 
Similar variegations can be used for CDRs of length 8 and 
10. In the preferred embodiment of this invention, those 
25 three lengths (8, 9 and 10) are included in the libraries 
of this invention in the native ratios, as described above. 

Table 9 shows series of diversity 
oligonucleotides and primers that may be used to construct 
the kappa chain diversities depicted in Table 7. 
30 (ii) Lambda Chain 

(a) Framework 

The lambda chain is preferably built in a 2a2 
framework with an L2J region. These are the most common V 
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and J regions in the native genes. Other frameworks , such 
as 31, 4b, la and 6a, and other J regions, such as L1J, L3J 
and L7J, however, may be used without departing from the 
scope of this invention, 
5 (b) CDR1 

In native human lambda chains, CDRls with length 
14 predominate, lengths 11, 12 and 13 also occur. While 
any of these can be used in this invention, lengths 11 and 
14 are preferred. For length 11 the sequence is: 

10 TG<2><4>L<4><4><4><3><4><4> and for Length 14 the sequence 
is: TG<1>SS<2>VG<1X3X2X3>VS, wherein <1> is 0.27 T, 0.27 
G and 0.027 each of ADEFHIKLMNPQRSVWY; <2> is 0.27 D, 0.27 
N and 0.027 each of AEFGH I KLMPQRSTVW Y ; <3> is 0.36 Y and 
0.0355 each of ADEFGHIKLMNPQRSTVW; and <4> is an equimolar 

15 mixture of amino acid residues ADEFGHIKLMNPQRSTVWY. Most 
preferably, mixtures (similar to those occurring in native 
antibodies) preferably, the ratio is 11 : 14 : : 23 : 46 : : 0 . 33 : 
0.67 of the three lengths are used. 

(c) CDR2 

20 In native human lambda chains, CDR2s with length 

7 are by far the most common. This length is preferred in 
this invention. The sequence of this Length 7 CDR2 is 
<4><4><4><2>RPS, wherein <2> is 0.27 D, 0.27 N, and 0.027 
each of AE FGH I KLM PQRST VW Y and <4> is an equimolar mixture 

25 of amino acid residues ADEFGHIKLMNPQRSTVW. 

(d) CDR3 

In native human lambda chains, CDR3s of length 10 
and 11 predominate, while length 9 is also common. Any of 
these three lengths can be used in the invention. Length 
30 11 is preferred and mixtures of 10 and 11 more preferred. 
The sequence of Length 11 is <4><5><4><2><4>S<4X4><4X4>V, 
where <2> and <4> are as defined for the lambda CDR1 and 
<5> is 0.36 S and 0.0355 each of ADFFGHIKLMNPQRTVWY. The 
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sequence of Length 10 is <5>SY<1X5>S<5><1><4>V, wherein 
<1> is an equimolar mixture of ADEFGHIKLMNPQRSTVWY; and <4> 
and <5> are as defined for Length 11. The preferred 
mixtures of this invention comprise an equimolar mixture of 
5 Length 10 and Length 11. Table 8 shows a preferred focused 
lambda light chain diversity in accordance with this 
invention. 

Table 9 shows a series of diversity 
oligonucleotides and primers that may be used to construct 
10 the lambda chain diversities depicted in Table 7 . 

Method of Construction of the Genetic Package 

The diversities of heavy chain* and the kappa and 
lambda light chains are best constructed in separate 
vectors. First a synthetic gene is designed to embody each 

15 of the synthetic variable domains. The light chains are 
bounded by restriction sites for ApaLI (positioned at the 
very end of the signal sequence) and AscI (positioned afer 
the stop codon) . The heavy chain is bounded by Sfil 
(positioned within the PelB signal sequence) and NotI 

20 (positioned in the linker between CHI and the anchor 

protein) . Signal sequences other than PelB may also need, 
e.g., a M13 pill signal sequence. 

The initial genes are made with "stuffer" 
sequences in place of the desired CDRs. A "Stuffer" is a 

25 sequence that is to be cut away and replaced by diverse DNA 
but which does not allow expression of a functional 
antibody gene. For example, the stuffer may contain 
several stop codons and restriction sites that will not 
occur in the correct finished library vector. For example, 

30 in Table 10, the stuffer for CDR1 of kappa A27 contains a 
StuI site. The vgDNA for CDR1 is introduced as a cassette 
from Espl, Xmal, or Aflll to either SexAI or Kasl. After 
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the ligation, the DNA is cleaved with StuI; there should be 
no StuI sites in the desired vectors. 

The sequences of the heavy chain gene with 
stuff ers is depicted in Table 6. The sequences of the 
5 kappa light chain gene with stuffers is depicted in Table 
10. The sequence of the lambda light chain gene with 
stuffers is depicted in Table 11. 

In another embodiment of the present intention 
the diversities of heavy chain and the kappa or lambda 

10 light chains are constructed in a single vector or genetic 
packages (e.g., for display or display and expression) 
having appropriate restriction sites that allow cloning of 
these chains. The processes to construct such vectors are 
well known and widely used in the art. Preferably, a heavy 

15 chain and Kappa light chain library and a heavy chain and 
lambda light chain library would be prepared separately. 
The two libraries, most preferably, will then be mixed in 
equimolar amounts to attain maximum diversity. 

Most preferably, the display is had on the 

20 surface of a derivative of M13 phage. The most preferred 
vector contains all the genes of M13, an antibiotic 
resistance gene, and the display cassette. The preferred 
vector is provided with restriction sites that allow 
introduction and excision of members of the diverse family 

25 of genes, as cassettes. The preferred vector is stable 
against rearrangement under the growth conditions used to 
amplify phage. 

In another embodiment of this invention, the 
diversity captured by the methods of the present invention 

30 may be displayed and/or expressed in a phagemid vector 

(e.g., pCESl) that displays and/or expresses the peptide, 
polypeptide or protein. Such vectors may also be used to 
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store the diversity for subsequent display and/or 
expression using other vectors or phage. 

In another embodiment of this invention, the 
diversity captured by the methods of the present invention 
5 may be displayed and/or expressed in a yeast vector. 
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I claim: 

1. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a heavy chain CDRl 
selected from the group consisting of: 

(1) <1>iY 2 <1> 3 M 4 <1> 5 , wherein <1> is an 
equimolar mixture of each of amino acid residues A, D, E, 
F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y; 

(2) (S/T) a (S/G/X) 2 (S/G/X) 3 Y 4 Y 5 W 6 (S/G/X) 7 . 
wherein (S/T) is a 1:1 mixture of S and T residues, (S/G/X) 
is a mixture of 0.2025 S, 0.2025 G and 0.035 of each of 
amino acid residues A, D, E, F, H, I, K, L, M, N, P, Q, R, 
T, V, W, and Y; 

( 3 ) V 1 S 2 G 3 G 4 S 5 I 6 S 7 <1> B <1> 9 <1> 10 Y 11 Y 12 W 13 <1> 14 , 
wherein <1> is an equimolar mixture of each of amino acid 
residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, 
W, and Y; and 

(4) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences. 

2. The focused library according to claim 1, 
wherein HC CDRls (1), (2) and (3) are present in the 
library in the ratio 0.80:0.17:0.02. 

3. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
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display, display and express, or comprise at least a 
portion of the diversity of the antibody facility, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a heavy chain CDR2 
selected from the group consisting of: 

(1) <2>I<2X3>SGG<1>T<1>YADSVKG, wherein 
<1> is an equimolar mixture of each of amino acid residues 
A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and 
Y; <2> is an equimolar mixture of each of amino acid 
residues Y, R, W, V, G, and S; and <3> is an equimolar 
mixture of each of amino acid residues P, S, and G or an 
equimolar mixture of P and S; 

(2) <!>I<4XlXlXGX5XlXlXl>YADSVKG, 
wherein <1> is an equimolar mixture of each of amino acid 
residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, 
W, and Y; <4> is an equimolar mixture of residues D, I, N, 
S, W, Y; and <5> is an equimolar mixture of residues S, G, 
D and N; 

(3) <1>I<4X1X1>G<5X1X1>YNPSLKG, wherein 
<1> is an equimolar mixture of each of amino acid residues 
A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 
and <4> and <5> are as defined above; 

(4) <1>I<8>S<1X1X1>GGYY<1>YAASVKG, 
wherein <1> is an equimolar mixture of each amino acid 
residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T f V f 
W and Y; <8> is 0.27 R and 0.027 of each of 
ADEFGHIKLMNPQSTVWY; and 

(5) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences. 

4. The focused library according to claim 3 
wherein a mixture of HC CDR2s (l)/(2) (equimolar), (3) and 
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(4) are present in the library in a ratio of 
0.54:0.43:0.03. 

5. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a heavy chain CDR3 
selected from the group consisting of: 

(1) YYCA2 111 1 YFD YWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 
G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(2) YYCA2 11111 1 YFD YWG > wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 
G, H, I , K, L, M, N, P, Q, R, S, T, V, w and Y; and 2 is an 
equimolar mixture of K and R; 

(3) YYCA211111111YFDYWG, wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(4) YYCAR111S2S3111YFDYWG, wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 
G, H, I , K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of S and G; and 3 is an equimolar mixture 
of Y and W; 

(5) YYC A2 1 1 1 CSG1 1 C Yl YFD YWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 
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(6) YYCA211SlTIFGlllllYFDyWG / wherein 1 is 
an equimolar mixture of each amino acid residues A, D, E, 

F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is 
an equimolar mixture of K and R; 

(7) YYCAR111YY2S3344111YFDYWG, wherein 1 is 
an equimolar mixture of each amino acid residues A, D, E, 

F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 2 is an 
equimolar mixture of D and G; and 3 is an equimolar mixture 
of S and G; 

(8) YYCAR1 111 YC 2231 C Y 1 1 1 YFD YWG , wherein 1 
is an equimolar mixture of each amino acid residues A, D, 
E, F, G, H, I, K, L, M, N, P, Q, R, S, T f V, W and Y; 2 is 
an equimolar mixture of S and G; and 3 is an equimolar 
mixture of T, D and G; and 

(9) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences. 

6. The focused library according to claim 5, 
wherein 1 in one or all of HC CDR3s (1) through (8) is 
0.095 of each of G and Y and 0.048 of each of A, D, E, F, 
H, I, K, L, M, N, P, Q, R, S, T, V, and W. 

7. The focused library according to claim 5 or 
6, wherein HC CDR3s (1) through (8) are present in the 
library in, the following proportions: 



(1) 


0.10 


(2) 


0.14 


(3) 


0.25 


(4) 


0 . 13 


(5) 


0.13 


(6) 


0.11 


(7) 


0 . 04 and 


(8) 


0.10 
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8. The focused library according to claim 5 or 
6, wherein the HC CDR3s (1) through (8) are present in the 
library in the following proportions: 



(1) 


0.02 


(2) 


0.14 


(3) 


0.25 


(4) 


0.14 


(5) 


0.14 


(6) 


0.12 


(7) 


0.08 and 


(8) 


0.11 



9. A focused library of vectors or genetic 
packages that display , display and express , or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display , display and express , or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encodes a kappa light chain 
CDR1 selected from the group consisting of: 

( 1 ) RASQ<1>V<2><2 ><3>LA 

(2) RASQ<1>V<2><2><2><3>LA; 

wherein <1> is an equimolar mixture of amino acid residues 
ADEFGHIKLMNPQRSTVWY; <2> is 0,2 S and 0,044 of each of 
ADE FGH I KLMN PQRT VW Y ; and <3> is 0.2Y and 0-044 each of 
ADE FGH I KLMN PQRT VW and Y; and 

(3) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences. 

10, The focused library of claim 9, wherein 
CDRls (1) and (2) are present in the library in a ratio of 
0.68:0.32, 
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11. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a kappa light chain 
CDR2 having the sequence: 

<1>AS<2>R<4><1>, 
wherein <1> is an equimolar mixture of amino acid residues 
ADE FGH I KLMN PQRS T VW Y ; <2> is 0.2 S and 0.044 of each of 
ADE FGH I KLMN PQRT VW Y ; and <4> is 0.2 A and ) 0.044 each of 
DE FGH I KLMN PQRST VW Y . 

12. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a kappa light chain 
CDR3 selected from the groups consisting of: 

(1) QQ<3X1X1><1>P<1>T, 

wherein <1> is an equimolar mixture of amino acid residues 
ADEFGHIKLMNPQRSTVWY; <3> is 0.2 Y and 0.044 each of 
ADE FGH I KLMN PQRT VW ; 

(2) QQ33111P, wherein 1 and 3 are as 

defined in (1) above; 
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(3) QQ3211PP1T, wherein 1 and 3 are as 
defined in (1) above and 2 is 0.2 S and 0.04 4 each of 
ADE FGH I KLMN PQRT VWY ; and 

(4) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences. 

13. The focused library according to claim 12, 
wherein CDR3s (1), (2) and (3) are present in the library 
in a ratio of 0 . 65 : 0 . 1 : 0 . 25 . 

14. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a lambda light chain 
CDRl selected from the group consisting of: 

(1) TG<1>SS<2>VG<1X3><2><3>VS, 
wherein <1> is 0.27 T, 0.27 G and 0.027 each of 
ADEFHIKLMNPQRSVWY, <2> is 0.27 D, 0.27 N and 0.027 each of 
AEFGHIKLMPQRSTVWY, and <3> is 0.36 Y and 0.036 each of 
ADE FGH I KLMN PQRSTVW ; 

(2) G<2X4>L<4><4><4><3><4><4>, 
wherein <2> is as defined in (1) above and <4> is an 
equimolar mixture of amino acid residues 

ADE FGHI KLMN PQRST VW Y ; and 

(3) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences. 
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15. The focused library according to claim 14, 
where CDRls (1) and (2) are present in the library in a 
ratio of 0.67:0.33. 

16. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a lambda light chain 
CDR2 has the sequence: 

<4><4><4><2>RPS, 
wherein <2> is 0.27 D, 0.27 N, and 0.027 each of 
AEFGHIKLMPQRSTVWY and <4> is an equimolar mixture of amino 
acid residues ADEFGHIKLMNPQRSTVW. 

17. A focused library of vectors or genetic 
packages that display, display and express, or comprise a 
member of a diverse family of human antibody related 
peptides, polypeptides and proteins and collectively 
display, display and express, or comprise at least a 
portion of the diversity of the antibody family, the 
vectors or genetic packages being characterized by 
variegated DNA sequences that encode a lambda light chain 
CDR3 selected from the group consisting of: 

(1) <4><5><4><2><4>S<4X4><4><4>V, 
wherein <2> is 0.27 D, 0.27 N, and 0.027 each of 
AEFGHIKLMPQRSTVWY; <4> is an equimolar mixture of amino 
acid residues ADEFGHIKLMNPQRSTVW; and <5> is 0.36 S and 
0.0355 each of ADEFGHIKLMNPQRTVWY; 
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(2) <5>SY<l><5>S<5Xl><4>V, wherein <1> is 
an equimolar mixture of ADE FGH I KLMN PQRS T VW Y ; and <4> and 
<5> are as defined in (1) above; and 

(3) mixtures of vectors or genetic packages 
characterized by any of the above DNA sequences, 

18. The focused library according to claim 17, 
wherein CDR3s (1) and (2) are present in the library in an 
equimolar mixture. 

19. The focused library according to claim 1 or 
2 further comprising variegated DNA sequences that encode a 
heavy chain CDR selected from the group consisting of: 

(1) one or more of the heavy chain CDR2s 
defined in claim 3 or 4; 

(2) one or more of the heavy chain CDR3s 
defined in claims 5, 6, 7, or 8; and 

(3) mixtures of vectors or genetic packages 
characterized by (1) and (2) . 

20. The focused library according to claim 3 
further comprising variegated DNA sequences that encodes 
one or more heavy chain CDR3s selected from the group 
defined in claims 5, 6, 1 or' 8. 

21. The focused library according to claim 19 or 
20, further comprising variegated DNA sequences that 
encodes a light chain CDR selected from the group 
consisting of 

(1) one or more the kappa light chain CDRls 
defined in claim 9 or 10; 

(2) the kappa light chain CDR2 defined in 

claim 11; 
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(3) one or more of the kappa light chain 
CDR3s defined in claim 12 or 13; 

(4) one or more of the kappa light chain 
CDRls defined in claim 14 or 15; 

(5) the lambda light chain CDR2 defined in 

claim 16; 

(6) one or more of the lambda light chain 
CDR3s defined in claim 17 or 18; and 

(7) mixtures of vectors and genetic 
packages characterized by one or more of (1) through (6). 

22. A population of variegated DNA sequences 
that encode a heavy chain CDR1 selected from the group 
consisting of: 

(1) <1> 1 Y 2 <1> 3 M 4 <1> 5 , wherein <1> is an 
equimolar mixture of each of amino acid residues A, D, E, 
F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y; 

(2 ) (S/T) x (S/G/X) 2 (S/G/X) 3 Y 4 Y 5 W 6 (S/G/X) 7 . 
wherein (S/T) is a 1:1 mixture of S and T residues, (S/G/X) 
is a mixture of 0.2025 S, 0.2025 G and 0.035 of each of 
amino acid residues A, D, E, F, H, I, K, L, M, N, P, Q, R, 
T, V, W, and Y; 

(3) V 1 S 2 G 3 G 4 S 5 I 6 S 7 <l> 8 <l>9<l>ioYiiYi2Wi3<l>i4f 
wherein <1> is an equimolar mixture of each of amino acid 

- residues A, D, E, F, G f H, I, K, L, M, N, P, Q, R, S, T, V, 
W, and Y; and 

(4) mixtures of variegated DNA sequences 
characterized by any of the above DNA sequences. 

23. The population of variegated DNA sequences 
according to claim 22, wherein HC CDRls (1), (2) and (3) 
are present in the population in the ratio 0.80:0.17:0.02. 
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24. A population of variegated DNA sequences 
that encode a heavy chain CDR2 selected from the group 
consisting of: 

(1) <2>I<2X3>SGG<1>T<1>YADSVKG, wherein 
<1> is an equimolar mixture of each of amino acid residues 
A, D, E, F, G, H, I, K, L, M, N, P, Q, Rr S, T, V, W, and 
Y; <2> is an equimolar mixture of each of amino acid 
residues Y, R, W, V, G, and S; and <3> is an equimolar 
mixture of each of amino acid residues P, S, and G or an 
equimolar mixture of P and S; 

(2) <l>I<4><l><l><G><5Xl><l><l>YADSVKG f 
wherein <1> is an equimolar mixture of each of amino acid 
residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, 
W, and Y; <4> is an equimolar mixture of residues D, I, N, 
S, W, Y; and <5> is an equimolar mixture of residues S, G, 
D and N; 

(3) <l>I<4XlXl>G<5XlXl>YNPSLKG, wherein 
<1> is an equimolar mixture of each of amino acid residues 
A, D, E, F, G, H, I, K f L, M, N, P, Q, R f S, T r V f W and Y; 
and <4> and <5> are as defined above; 

(4) <l>K8>S<lXlXl>GGYY<l>YAASVKG f 
wherein <1> is an equimolar mixture of each amino acid 
residues A, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, 
W and Y; <8> is 0.27 R and 0.027 of each of 
ADEFGHTKLMNPQSTVWY ; and 

(5) mixtures of variegated DNA sequences 
characterized by any of the above DNA sequences. 

25. The population of variegated DNA sequences 
according to claim 24 , wherein a mixture of HC CDR2s 
(l)/(2) (equimolar), (3) and (4) are present in the 
population in a ratio of 0.54:0.43:0.03. 
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26, A population of variegated DNA sequences 
that encode a heavy chain CDR3 selected from the group 
consisting of: 

(1) YYCA2 11 1 1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(2) YYCA2 11111 1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 
G, H, I, K, L, M, N, P, Q/ R# S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(3) YYCA2 1111111 1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q# R, S f T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(4) YYC AR1 1 1 S 2 S 3 1 1 1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of S and G; and 3 is an equimolar mixture 
of Y and W; 

(5) Y YCA2 1 1 1 CSG1 1 CY1 YFDYWG , wherein 1 is an 
equimolar mixture of each amino acid residues A, D, E, F, 

G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is an 
equimolar mixture of K and R; 

(6) YYCA2 11S1TIFG1111 1 YFDYWG , wherein 1 is 
an equimolar mixture of each amino acid residues A, D, E, 

F/ G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; and 2 is 
an equimolar mixture of K and R; 

(7) YYCAR111YY2S3344111YFDYWG, wherein 1 is 
an equimolar mixture of each amino acid residues A, D, E, 

F/ G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 2 is an 
equimolar mixture of D and G; and 3 is an equimolar mixture 
of S and G; 

RECTIFIED SHEET (RULE 91) 
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(8) YYCAR1111YC2231CY111YFDYWG, wherein 1 
is an equimolar mixture of each amino acid residues A, D, 
E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W and Y; 2 is 
an equimolar mixture of S and G; and 3 is an equimolar 
mixture of T, D and G; and 

(9) mixtures of variegated DNA sequences 
characterized by any of the above DNA sequences. 

27. The population of variegated DNA according 
to claim 26, wherein 1 in one or all of HC CDR3s (1) 
through (8) is 0.095 of each of G and Y and 0.048 of each 
of A, D, E, F, H, I, K, L, M, N, P, Q, R, S, T, V, and W. 

28. The population of variegated DNA sequences 
according to claim 26 or 27, wherein HC CDR3s (1) through 
(8) are present in the population in the following 
proportions: 



(1) 


0.10 


(2) 


0.14 


(3) 


0.25 


(4) 


0.13 


(5) 


0.13 


(6) 


0.11 


(7) 


0.04 and 


(8) 


0.10 



29. The population of variegated DNA sequences 
according to claim 26 or 27, wherein the HC CDR3s (1) 
through (8) are present in the population in the following 
proportions : 

(1) 0.02 

(2) 0.14 

(3) 0.25 
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(4) 0.14 

(5) 0.14 

(6) 0.12 

(7) 0.08 and 

(8) 0.11 

30. A population of variegated DNA sequences 
that encode a kappa light chain CDR1 selected from the 
group consisting of: 

{ 1 ) RASQ<1>V<2X2X3>LA 

( 2 ) RASQ<1>V<2X2X2X3>LA; 

wherein <1> is an equimolar mixture of amino acid residues 
ADEFGHIKLMNPQRSTVWY; <2> is 0.2 S and 0.044 of each of 
ADE FGH I KLMN PQRT VW Y ; and <3> is 0.2Y and 0.044 each of 
ADEFGHIKLMNPQRTVW and Y; and 

(3) mixtures of variegated DNA sequences 
characterized by any of the above DNA sequences. 

31. The population of variegated DNA sequences 
of claim 30, wherein CDRls (1) and (2) are present in the 
population in a ratio of 0.68:0.32. 

32. A population of variegated DNA sequences 
that encode a kappa light chain CDR2 having the sequence: 

<1>AS<2>R<4X1>, 
wherein <1> is an equimolar mixture of amino acid residues 
ADEFGHIKLMNPQRSTVWY; <2> is 0.2 S and 0.044 of each of 
ADE FGH I KLMN PQRT VW Y ; and <4> is 0.2 A and ) 0.044 each of 
DEFGHIKLMNPQRSTVWY . 

33. A population of variegated DNA sequences 
that encode a kappa light chain CDR3 selected from the 
groups consisting of: 
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(1) QQ<3X1X1X1>P<1>T, 

wherein <1> is an equimolar mixture of amino acid residues 
ADE FGHI KLMN PQRST VWY ; <3> is 0.2 Y and 0.044 each of 
ADE FGH I KLMN PQRT VW ; 

(2) QQ33111P, wherein 1 and 3 are as 
defined in (1) above; 

(3) QQ3211PP1T, wherein 1 and 3 are as 
defined in (1) above and 2 is 0.2 S and 0.044 each of 
ADEFGHIKLMNPQRTVWY; and 

(4) mixtures of variegated DNA sequences 
characterized by any of the above DNA sequences. 

34. The population of variegated DNA sequences 
according to claim 33, wherein CDR3s (1), (2) and (3) are 
present in the population in a ratio of 0.65:0.1:0.25. 

35. A population of variegated DNA sequences 
that encode a lambda light chain CDRl selected from the 
group consisting of: 

(1) TG<1>SS<2>VG<1><3><2><3>VS, 
wherein <1> is 0.27 T, 0.27 G and 0.027 each of 
ADE FH I KLMN PQRS VW Y , <2> is 0.27 D, 0.27 N and 0.027 each of 
AE FGH I KLMPQRST VW Y , and <3> is 0.36 Y and 0.036 each of 
ADE FGH I KLMN PQRST VW; 

" (2) G<2><4>L<4><4><4><3><4><4>, 

wherein <2> is as defined in (1) above and <4> is an 
equimolar mixture of amino acid residues 
ADE FGH I KLMN PQRS T VW Y ; and 

(3) mixtures of variegated DNA sequences 
characterized by any of the above DNA sequences. 
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36. The population of variegated DNA sequences 
according to claim 35, where CDRls (1) and (2) are present 
in the population in a ratio of 0.67:0.33. 

37. A population of variegated DNA sequences 
that encode a lambda light chain CDR2 has the sequence: 

<4><4><4><2>RPS, 
wherein <2> is 0.27 D, 0.27 N, and 0.027 each of 
AEFGHIKLMPQRSTVWY and <4> is an equimolar mixture of amino 
acid residues ADEFGHIKLMNPQRSTVW. 

38. A population of variegated DNA sequences 
that encode a lambda light chain CDR3 selected from the 
group consisting of: 

(1) <4><5><4><2><4>S<4><4><4X4>V / 
wherein <2> is 0.27 D, 0.27 N, and 0.027 each of 
AEFGHIKLMPQRSTVWY; <4> is an equimolar mixture of amino 
acid residues ADEFGHIKLMNPQRSTVW; and <5> is 0.36 S and 
0.0355 each of ADEFGHIKLMNPQRTVWY; 

(2) <5>SY<1><5>S<5X1><4>V, wherein <1> is 
an equimolar mixture of ADEFGHIKLMNPQRSTVWY; and <4> and 
<5> are as defined in (1) above; and 

(3) mixtures of variegated DNA sequence 
characterized by any of the above DNA sequences. 

39. The population of variegated DNA sequences 
according to claim 38, wherein CDR3s (1) and (2) are 
present in the population in an equimolar mixture. 

40. The population of variegated DNA sequences 
according to claim 22 or 23 further comprising variegated 
DNA sequences that encode a heavy chain CDR selected from 
the group consisting of: 
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(1) one or more of the heavy chain CDR2s 
defined in claim 24 or 25; 

(2) one or more of the heavy chain CDR3s 
defined in claims 26, 27 , 28 or 29; and 

(3) mixtures of variegated DNA sequence 
characterized by (1) and (2) . 

41. The population of variegated DNA sequences 
according to claim 24 further comprising variegated DNA 
sequences that encodes one or more heavy chain CDR3s 
selected from the group defined in claims 26, 27, 28 or 29. 

42. The population of variegated DNA sequences 
according to claim 40 or 41 further comprising variegated 
DNA sequences that encodes a light chain CDR selected from 
the group consisting of 

(1) one or more the kappa light chain CDRls 
defined in claim 30 or 31; 

(2) the kappa light chain CDR2 defined in 

claim 32; 

(3) one or more of the kappa light chain 
CDR3s defined in claim 33 or 34; 

(4) one or more of the kappa light chain 
CDRls defined in claim 35 or 36; 

(5) the lambda light chain CDR2 defined in 

claim 37; 

(6) one or more of the lambda light chain 
CDR3s defined in claim 38 or 39; and 

(7) mixtures of variegated DNA sequences 
characterized by one or more of (1) through (6). 

43. A population of vectors comprising the 
variegated DNA sequences of any one of claims 22-42. 



